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Abstract 


Searches for supersymmetry (SUSY) are performed using a sample of hadronic events 
produced in 8 TeV pp collisions at the CERN LHC. The searches are based on the Mj 2 
variable, which is a measure of the transverse momentum imbalance in an event. The 
data were collected with the CMS detector and correspond to an integrated luminos¬ 
ity of 19.5 ftU 1 . Two related searches are performed. The first is an inclusive search 
based on signal regions defined by the value of the Mt 2 variable, the hadronic energy 
in the event, the jet multiplicity, and the number of jets identified as originating from 
bottom quarks. The second is a search for a mass peak corresponding to a Higgs bo¬ 
son decaying to a bottom quark-antiquark pair, where the Higgs boson is produced as 
a decay product of a SUSY particle. For both searches, the principal backgrounds are 
evaluated with data control samples. No significant excess over the expected number 
of background events is observed, and exclusion limits on various SUSY models are 
derived. 
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1 Introduction 


Searches for physics beyond the standard model (SM) based on final states with jets and large 
values of transverse momentum imbalance p| 11iss are sensitive to a broad class of new-physics 
models. Here, we report the results of such searches based on the Mj 2 variable |[TJ. The Mj 2 
variable characterizes p™ iss in events with two pair-produced heavy particles, each of which 
decays to at least one undetected particle, leading to p| lliss . An example is supersymmetry 
(SUSY) with R-parity conservation |2|, in which pair-produced SUSY particles each decay to 
SM particles and to a massive, neutral, weakly interacting lightest SUSY particle (LSP), which 
escapes without detection. The value of Mj 2 reflects the masses of the pair-produced particles, 
which are much lighter for SM background processes than expected for SUSY particles such 
as squarks and gluinos. The Mj 2 variable was previously used for top-quark mass measure¬ 
ments by the CDF and CMS experiments [|3]ID, and for SUSY searches by the CMS |5]|63 and 
ATLAS l7l4T3l experiments. 

This paper describes searches for physics beyond the SM performed using a data sample of 
pp collisions collected in 2012 at a centre-of-mass energy of 8 TeV with the CMS detector at the 
CERN LHC. The size of the sample, measured by its integrated luminosity, is 19.5 fb 1 . 

Two different Mi 2 -based searches are presented. The first search, called the inclusive-Mi 2 
search, employs several signal regions defined by the number of jets (Nj), the number of tagged 
bottom-quark jets (Nb), the value of Mx 2 , and the hadronic energy in an event. This general 
search aims to cover a large variety of SUSY and other new-physics signatures. The second 
search, called the Mx 2 -Higgs search, is a specialized analysis targeting events with a Higgs bo¬ 
son produced in the decay of a heavy SUSY particle. The SM Higgs boson decays primarily 
to a bottom quark-antiquark (bb) pair. For a large variety of SUSY models, the lightest Higgs 
boson (h boson) has SM properties, especially if the masses of all other SUSY Higgs bosons are 
much larger. In the Mx 2 -Higgs search, we therefore search for an excess of events at the SM 
Higgs boson mass of 125 GeV in the invariant mass distribution of b-tagged jet pairs. 


The two searches rely on similar selection criteria for the Mx 2 variable to enhance the sensitivity 
to a potential SUSY signal and to reduce the background from SM multijet events to a minimal 
level. The remaining SM background consists mostly of Z+jets events where the Z boson decays 
to neutrinos, and W+jets and tt +jets events where one W boson decays leptonically. These 
backgrounds are mostly estimated by methods using data. 


This analysis extends a previous CMS publication |5|, based on pp collisions at 7 TeV, by ex¬ 
ploiting a higher collision energy and a larger data sample. Alternative inclusive searches in 
hadronic final states based on the 8 TeV data sample are presented in Refs. lfl4Hl8l . 


This paper is organized as follows. In Sectio n |2j the Mx 2 variable is defined. A description 
of the detector and trigger is given in Section|3| The data sets and the general event selection 
procedures are discussed in Section^ Section|5]presents the analysis strategy for the inclusive- 
Mx 2 and Mx 2 -Higgs searches, and Section[6]the background estimation method based on data 
control samples. A comparison between the observed numbers of events and the predicted 
background yields is presented for the two searches in Section [7] Systematic uncertainties are 
discussed in Section [8] The statistical procedures used to calculate exclusion limits on SUSY 
particles are presented in Section [9j with the limits themselves presented in Section 10 Sec¬ 
tion HT] contains a summary. 
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3 Detector and trigger 


2 Definition of the M T2 variable and interpretation 

The use of Mj 2 as a search variable is discussed in our previous publication |5J. Here, we 
recapitulate the most salient aspects. The kinematic mass variable Mj 2 was introduced as a 
means to measure the mass of pair-produced particles in situations where both particles decay 
to a final state containing an undetected particle X of mass nix■ For each decay chain, the visi¬ 
ble system is defined by the transverse momentum transverse energy Ej 8 ' 1 ' 1 , and mass 

m vis (o (z = 1, 2) obtained by summing the four-momenta of all detected particles in the decay 
chain. The two visible systems are accompanied by the two undetected particles with unknown 
transverse momenta p T . In analogy with the transverse mass used for the W boson mass de¬ 
termination 1191 , two transverse masses are defined for the two pair-produced particles: 


( Mj } ) 2 = ( m vis( ')) 2 


+ m \ 


pvis(z') pX(z') 

ij'j’ L - ,r J n 


->vis(z) ->X(z) 

Ft -Ft 


( 1 ) 


If the correct values of nix and m v ' sll> , and Py lsl,) are chosen, the transverse masses M-p 

x(i) 

do not exceed the mass of the parent particles. The momenta p T y of the unseen particles, 
however, are not experimentally accessible individually. Only their sum, the missing trans¬ 
verse momentum p| 11iss , is known. A generalization of the transverse mass, the My 2 variable, is 
defined as: 


{nix) — 


min 


Ft +Ft -Ft” 


max 


( 2 ) 


where the unknown mass nix is a free parameter. The minimization is performed over trial 
momenta of the undetected particles fulfilling the p™' ss constraint. 


In this analysis, all visible objects, such as jets, are clustered into two pseudojets. For this 
purpose, we use the hemisphere algorithm defined in Section 13.4 of Ref. 11201 . The algorithm 
is seeded by the two jets with largest dijet invariant mass. The clustering is performed by 
minimizing the Lund distance measure ll2Tl[22l . Standard model multijet events, interpreted 
as two pseudojets, may give rise to large Mj 2 if both pseudojets have large masses. Setting 
m vls (') = 0 in Eq. <0 suppresses the multijet contributions without affecting signal sensitivity, 
since the kinematic terms of Eq. (|T[) are large for most new-physics scenarios. In the following, 
Mj 2 is computed using (z = 1,2), and p™ iss , setting both m vls F) terms in Eq. (JTJ) to 

zero. 


Although most the background from SM multijet events is thus characterized by small values 
of Mj 2 / a residual background at large Mj 2 arises from multijet events in which the two pseu¬ 
dojets are not back-to-back because of jet energy mismeasurements. Further selection criteria 
are applied to suppress these events, as discussed in Section [4] 


3 Detector and trigger 

The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diame¬ 
ter, providing a magnetic field of 3.8 T. Within the superconducting solenoid volume are a sil¬ 
icon pixel and strip tracker, a lead-tungstate crystal electromagnetic calorimeter, and a brass/ 
scintillator hadron calorimeter, each composed of a barrel and two endcap sections. Muons 
are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the 
solenoid. Extensive forward calorimetry complements the coverage provided by the barrel 
and endcap detectors. The detector is nearly hermetic, covering 0 < (p < 2n in azimuth, and 
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thus allows the measurement of momentum balance in the plane transverse to the beam direc¬ 
tion. The first level of the CMS trigger system, composed of custom hardware processors, uses 
information from the calorimeters and muon detectors to select the most interesting events in 
a fixed time interval of less than 4 jis. The high level trigger processor farm further decreases 
the event rate, from around 100 kHz to around 300 Hz, before data storage. A more detailed 
description of the CMS detector, together with a definition of the coordinate system used and 
the relevant kinematic variables, can be found in Ref. |[23l . 

Events are selected using three complementary triggers. A trigger based on the scalar sum of jet 
px values (Hj) requires Hj > 650 GeV. A second trigger requires E™ 1SS > 150 GeV, where E™ ISS 
is the magnitude of pj™ ss . A third trigger requires Hj > 350 GeV and E™ 1SS > 100 GeV. The 
trigger efficiency is measured to be larger than 99% for events that satisfy the event selection 
criteria outlined in Section |H 


4 Data sets and event selection 

The event selection is designed using simulated samples of background and signal processes. 
Background events are generated with the MadGraph 5 It24l , PYTHIA 6.4.26 l22l , and POWHEG 
1.0 l25l programs. Signal event samples based on simplified model scenarios (SMS) l26l are 
generated using the MadGraph 5 program, with the decay branching fractions of SUSY par¬ 
ticles set either to 0% or 100% depending on the SUSY scenario under consideration. We also 
generate signal events in the context of the constrained minimal supersymmetric SM (cMSSM/ 
mSUGRA) G7I . The cMSSM/ mSUGRA events are generated using the PYTHIA program, with 
the SDECAY 11281 program used to describe the SUSY particle decay branching fractions and the 
SOFTSUSY 11291 program to calculate the SUSY particle mass spectrum. The PYTHIA program is 
used to describe the parton shower and hadronization. While all generated background sam¬ 
ples are processed with the detailed simulation of the CMS detector response, based on GEANT 
4 1301 . for signal samples the detector simulation is performed using the CMS fast simulation 
package lf3Tj| . Detailed cross checks are conducted to ensure that the results obtained with fast 
simulation are in agreement with the ones obtained with GEANT-based detector simulation. For 
SM backgrounds, the most accurate calculations of the cross sections available in the literature 
are used 132} 33]. These are usually at next-to-leading order (NLO) in «§. For the SUSY signal 
samples, cross sections are calculated at NLO H34h38] using the PROSPINO 2.1 f39l program. 

The data and simulated events are reconstructed and analyzed in an identical manner. The 
event reconstruction is based on the particle-flow (PF) algorithm |4D} [411 , which reconstructs 
and identifies charged hadrons, neutral hadrons, photons, muons, and electrons. Electrons and 
muons are required to have transverse momentum pj > 10 GeV and pseudorapidity \r]\ < 2.4. 
For electrons, the transition region between barrel and endcaps (1.442 < \rj\ < 1.566) is ex¬ 
cluded because the electron reconstruction in this region is not optimal. An isolation require¬ 
ment is also employed, requiring that the pj sum of photons, charged hadrons, and neutral 
hadrons, in a cone of A R = V(A//) 2 + (A cp) 2 = 0.3 along the lepton direction, divided by the 
lepton pj value, be less than 0.15 for electrons and 0.20 for muons. The isolation value is cor¬ 
rected for the effects of pileup, that is, multiple pp collisions within the same bunch crossing 
as the primary interaction. The electron and muon reconstruction and identification criteria 
are described in Refs. l42l and Il43l . respectively. All particles, except the isolated electrons and 
muons, are clustered into PF jets Il44ll using the anti -kj jet-clustering algorithm Ii45l with a size 
parameter of 0.5. The jet energy is calibrated by applying correction factors as a function of the 
pj and the rj of the jet l44l . The effect of pileup on jet energies is treated as follows: tracks not 
associated with the primary interaction are removed from the jet; for the neutral part of the jet. 
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5 Search strategy 


the effect of pileup is reduced using the FastJet pileup subtraction procedure [46} 073 . All jets 
are required to satisfy basic quality criteria (jet ID 1481 ), which eliminate, for example, spurious 
events due to calorimeter noise. Jets are also required to have pj > 20GeV and j ?/1 < 2.4. Jets 
are b-tagged using the medium working point of the combined secondary vertex (CSV) algo¬ 
rithm [49| . Tau leptons are reconstructed in their decays to one or three charged particles |5Q| 
and are required to have pj > 20GeV and \rj\ < 2.3. The r leptons are also required to sat¬ 
isfy a loose isolation selection: the px~sum of charged hadrons and photons that appear within 
A R < 0.5 of the candidate T-lepton direction is required to be less than 2 GeV after subtraction 
of the pileup contribution. Throughout this paper, any mention of a r lepton refers to its recon¬ 
structed hadronic decay Photons Il5l1 are required to have pj > 20GeV, \rj\ < 2.4, and to not 
appear in the transition region between the barrel and endcap detectors. Photons are further 
required to satisfy selection criteria based on the shape of their calorimetric shower, to deposit 
little energy in the hadron calorimeter, and to fulfill isolation requirements. 

The missing transverse momentum vector p| 11iss is defined as the projection onto the plane 
perpendicular to the beam axis of the negative vector sum of the momenta of all reconstructed 
particles in the event. Its magnitude is referred to as E™ 1SS . The hadronic activity in the event, 
Hj, is defined to be the scalar pj sum of all accepted jets with pj > 50 GeV and \rj\ <3.0. Events 
selected with the pure-Hj trigger described in Section [3] are required to satisfy Hj > 750 GeV. 
Events selected with one of the two other triggers are required to satisfy Hj > 450 GeV and 
E™ iss > 200 GeV. 

Corrections for differences observed between the simulation and data due to the jet energy 
scale mi, the b-tagging efficiencies l49l . and the pj spectrum of the system recoil B52l are 
applied to simulated events. 

Events are required to contain at least two jets that, in addition to the previous general jet re¬ 
quirements, have px > 100 GeV. To reduce the background from events with W(fV)+jets and 
top-quark production, events are rejected if they contain an isolated electron, muon, or r lepton. 
Background from multijet events, which mostly arises because of jet energy misreconstruction, 
is reduced by requiring the minimum difference A<J> m i n in azimuthal angle between the pj' ss 
vector and one of the four jets with highest pj to exceed 0.3 radians. To reject events in which 
Ej' ss arises from unclustered energy or from jets aligned near the beam axis, a maximum dif¬ 
ference of 70 GeV is imposed on the magnitude of the vectorial difference between p| 11iss and 
the negative vector sum of the pj of all leptons and jets. Finally, events with possible contribu¬ 
tions from beam halo processes or anomalous noise in the calorimeter or tracking systems are 
rejected |53|. 

5 Search strategy 

The Mi 2 -based search strategy is outlined in this section. For both the inclusive-Mx 2 and the 
Mx 2 -Higgs searches, all selected jets are clustered into two pseudojets as described in Section[2] 
Several mutually exclusive signal regions are defined to optimize the search for a wide variety 
of new-physics models. The definition of signal regions is based on the event topology and 
event kinematic variables. The more general inclusive-Mx 2 search is described first. 

The inclusive-Mx 2 and Mx 2 -Higgs searches are not mutually exclusive. All but 4% of the events 
selected by the Mx 2 -Higgs search are also selected by the inclusive-Mx 2 search. 



5.1 lnclusive-M T2 search 
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5.1 Inclusive-Mx 2 search 

For the inclusive-Mx 2 search, nine regions, called topological regions, are defined by Nj and Nb, 
the numbers of jets and b-tagged jets in the event with pj > 40 GeV, as illustrated in Fig. [I] (left). 
These regions are chosen after testing the sensitivity of the search to various SUSY SMS models 
using simulated data. The regions with Nb = 0 are the most sensitive to the production of 
gluinos that do not decay to top and bottom quarks, and to the production of squarks of the 
first two generations. The regions with Nb > 0 and low (high) values of Nj are designed for 
bottom- and top-squark production with decays to bottom (top) quarks. Finally, the signal 
regions with Nj > 3 and Nb > 3 provide extra sensitivity to final states with multiple bottom 
or top quarks, for example from gluino pair-production. Since the values of Mj 2 and Hj in 
a SUSY event depend strongly on the mass of the initially produced SUSY particles, a wide 
range of values in Mj 2 and Hj is considered. Each of the nine topological regions is divided 
into three sub-regions of Hj, as shown in Fig. [l] (right): the low-Hj region 450 < Hj < 750 GeV, 
the medium-Hj region 750 < Hj < 1200 GeV, and the high-Hj region Hj > 1200 GeV. 
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Figure 1: Definition of the topological signal regions in terms of the number of jets Nj and the 
number of b-tagged jets Nb (left), and their subsequent division in terms of Hj and E™ ISS (right). 
The pie charts illustrate the expected contributions from different SM processes in the different 
signal regions; they are similar in all three Hj regions. 


Each of these regions is examined in bins of Mj 2 , where the number of bins (up to nine) de¬ 
pends on the specific topological and Hj selection. By design, the lowest bin in Mj 2 is chosen 
such that the multijet background is expected to be less than ~1-10% of the total background. 
The minimum threshold on Mj 2 varies between 100 and 200 GeV, depending on the topolog¬ 
ical region and the Hj requirement. The edges of the Mj 2 bins are adjusted to ensure that 
there are a sufficient number of events in each bin of the corresponding control samples for the 
background evaluation (Section J6j. The definitions of all signal regions are specified in Table [l] 

Figure [2] shows the Mj 2 distributions in simulation and data for the low-, medium-, and high- 
Hj selections, inclusively in all signal regions of the Nj-Nb plane. For Mj 2 < 80 GeV the dis¬ 
tribution in the medium- and high-Hx regions is completely dominated by multijet events. For 
this reason, these bins are used only as control regions. 

In the signal regions with Nj = 2 or Nb = 0, the dominant background is from Z (vi7)+jets 
production. The next-most important background is from W(/T')+jets events, while the back¬ 
ground from tt +jets events is small. In the regions with Nb = 1 all three processes (Z (I'Tj+jets, 
W(U/)+jets, and tt -t-jets production) are important. For all regions requiring multiple b-tagged 

















































6 


5 Search strategy 


Table 1: Definition of the signal regions used in the inclusive-Mx 2 search. 



Low-Hx region 

Medium-Hx region 

High-Hx region 


Mx 2 bin [GeV] 

Mj 2 bin [GeV] 

Mj 2 bin [GeV] 

II II 

0 u 

200-240 

350-420 

570-650 

125-150 

220-270 

425-580 

120-150 

260-350 

240-290 

290-350 

420-490 

490-570 

>650 

150-180 

180-220 

270-325 

325^25 

580-780 

>780 

150-200 

200-260 

350-550 

>550 

Nj = 2 , 

200-250 

310-380 

450-550 

100-135 

170-260 

>450 

100-180 


N b >l 

250-310 

380-450 

>550 

135-170 

260^50 


>180 



200-240 

420-490 


160-185 

300-370 

>800 

160-185 

350-450 

Nj = 3-5, 

240-290 

490-570 


185-215 

370-480 


185-220 

450-650 

N b = 0 

290-350 

570-650 


215-250 

480-640 


220-270 

>650 


350-420 

>650 


250-300 

640-800 


270-350 


Nj = 3-5, 

200-250 

310-380 

460-550 

150-175 

210-270 

380-600 

150-180 

230-350 

N b = l 

250-310 

380-460 

>550 

175-210 

270-380 

>600 

180-230 

>350 

Nj = 3-5, 

200-250 

325-425 


130-160 

200-270 

>370 

130-200 


N b = 2 

250-325 

>425 


160-200 

270-370 


>200 


Nj > 6 , 

200-280 

>380 


160-200 

250-325 

>425 

160-200 

>300 

N b = 0 

280-380 



200-250 

325^25 


200-300 


Nj > 6 , 

200-250 

>325 


150-190 

250-350 


150-200 

>300 

N b = l 

250-325 



190-250 

>350 


200-300 


Nj > 6 , 

200-250 

>300 


130-170 

220-300 


130-200 


N b = 2 

250-300 



170-220 

>300 


>200 


Nj >3, 

N b >3 

200-280 

>280 


125-175 

175-275 

>275 

>125 


jets, tt +jets events are the dominant source of background. The tt +jets contribution to the total 
background typically increases with the jet multiplicity and is important for all selections with 
Nj > 6 , regardless of the N j, selection. The relative contribution of tt +jets production decreases 
with increasing Mj 2 because of the natural cutoff of Mj 2 above the top-quark mass for these 
events. 

Contributions from other backgrounds, such as 7 +jets, Z (£ + £~)+ jets, and diboson production, 
are found to be negligible. 

5.2 M T 2 -Higgs search 

The Mj 2 -Higgs search is designed to select events with a light h boson produced in a cascade 
of supersymmetric particles initiated through the strong pair production of squarks or gluinos. 
As the dominant decay mode of the h boson in many SUSY models is h —» bb, a signature of 
a SUSY signal would be an excess in the invariant mass distribution of the selected b-tagged 
jet pairs, Mbb- An excess could help identify a preferred new-physics model, as the associated 
new particles would couple to the Higgs sector. Such an identification is not possible with the 
inclusive-Mi 2 search. 

Within a cascade of SUSY particles, the h boson is produced together with the LSP in the decays 
of neutralinos, such as X 2 Xj + h. As the neutralino x '2 can be a typical decay product of 
squarks and gluinos, the cross section for this kind of processes is among the largest in a large 
part of the SUSY parameter space. The final state contains at least two b-tagged jets, multiple 
hard jets, and a large value of Mx 2 - 

For the Mx 2 -Higgs search, b-tagged jets are required to have pj > 20 GeV. The event selection 



5.2 M T2 -Higgs search 


7 


19.5 ft) 1 (8 TeV) 



200 300 400 500 600 700 800 

M T2 [GeV] 


> 

CD 

(3 

LO 


C/) 

H— 1 

c 

CD 

> 

LU 



19.5 fb 1 (8 TeV) 


I 1 1 1 1 I 

| Multijet 
|W+jets 
|Z+jets 
[Top quark^ 
lOther 
Data 


* 


~7 



100 200 300 400 500 600 700 800 


M T2 [GeV] 



10 0 100 200 300 400 500 600 700 800 

M T2 [GeV] 


Figure 2: Distribution of the Mj 2 variable after the low-Hj (top), medium-Hj (bottom left), and 
high-Hj (bottom right) event selections, respectively. The event yields are integrated in the (Nj, 
N b ) plane over all the topological signal regions, for both simulated and data samples. These 
plots serve as an illustration of the background composition of the Mj 2 distributions. 


requires at least two b-tagged jets, along with Nj > 4. The two b-tagged jets stemming from the 
h boson decay are generally expected to appear within the same pseudojet, as they originate 
from the same decay chain. Using b-tagged jets within the same pseudojet, a b-tagged jet pair 
is selected if it has AR(bi,b 2 ) < 1.5. If multiple pairs are found in one or both pseudojets, 
the pair with the smallest AR(bi,b 2 ) is chosen. If no pair is found within the same pseudo¬ 
jet, pairs with b-tagged jets in different pseudojets are considered. If none of the pairs has 
AEfb^b?) < 1.5, the event is rejected. For signal events containing b quarks from the h boson 
decay, the efficiency to find the correct pair of b-tagged jets is about 70%. 

Using the known h boson mass of 125 GeV ||54[ [5B1, 12 signal regions are defined as 15 GeV-wide 
bins in the 20 < Mbb < 200 GeV range. Each of these signal regions is further divided into two 
sub-regions as follows: a low-Hx selection requiring 450 < Hj < 750 GeV, Ej' ss > 200 GeV, 
and Mj 2 > 200 GeV; and a high-Hj selection requiring Hj > 750 GeV and Mj 2 > 125 GeV. 
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6 Background estimation 


The overall yields of the main SM backgrounds (tt +jets / W(/:i')+jets, and Z (i/I/)+jets) are esti¬ 
mated using the same methods as for the inclusive-Mi 2 analysis. The contribution of the SM 
production of the Higgs boson is negligible in the search regions of this analysis. The shapes 
of the Mbb distributions for signal and the various backgrounds are obtained from simulation. 
Since in simulation we observe no appreciable correlation between Mj 2 and Mbb i n either the 
signal or background sample, the shape of the Mbb distribution is obtained from large simu¬ 
lated samples with relaxed Mj 2 requirements. An uncertainty due to the looser Mj 2 selection 
is taken into account. Further uncertainties in the shapes are assessed by varying several mod¬ 
elling parameters of the simulation. 


6 Background estimation 

This section describes the procedures used to estimate the main backgrounds: multijet events, 
Z+jets events where the Z boson decays to neutrinos, and W+jets and tt +jets events where 
one W boson decays leptonically but the corresponding charged lepton lies outside the accep¬ 
tance of the analysis, is not reconstructed, or is not isolated. The same background estimation 
procedures are used for both the inclusive-Mj 2 and Mj 2 -Higgs searches. 

6.1 Determination of the multijet background 

The multijet background consists of direct multijet production, but also of events with tt pairs 
or vector bosons that decay hadronically. From Fig. [2j the multijet background is expected 
to be negligible at large values of Mt 2 - This background, arising from difficult-to-model jet 
energy mismeasurements, is nonetheless subject to considerable uncertainty. A method based 
on data control samples is used to predict this background. The method relies on Mj 2 and 
the variable Acp inin , described in Section |4] In general terms, the multijet background entering 
each of the signal regions, for which a selection requirement is Acp m \n > 0.3, is estimated from 
a corresponding control region defined by the same criteria as the signal regions except for 
A(p m in, which is required to be less than 0.2. The control regions are dominated by multijet 
event production. 

The transfer factor between control and signal regions, and our parameterization thereof, are 
given by 

r(M T2 ) = ^ ° n 3 \ = exp (a - b M T2 ) + c for M T2 > 50 GeV. (3) 

y/\(p m in < U.ZJ 

The parameters a and b are obtained from a fit to data in the region 50 < Mj 2 < 80 GeV, where 
the contributions of electroweak and top-quark (mainly tt +jets events) production are small. 
The constant term c is only measurable in control samples requiring high-Mi 2 values. For these 
events, however, the non-multijet contribution is dominant, and so c cannot be obtained from a 
fit to data. Therefore, the parameterization of r(Mj 2 ) is fixed to a constant for Mx 2 > 200 GeV. 
This constant is chosen as the value of the exponential fit to r(Mx 2 ) at Mx 2 = 200 GeV. 

The parameterization is validated by fitting r{Mjj) to a sample of simulated multijet events, 
and multiplying this ratio by the number of events found in data with Acp m i n < 0.2. The result 
is compared to the number of events in data with Acp m i n > 0.3, after subtraction of the non¬ 
multijet contribution using simulation. An example is shown in Fig. [3j The prediction is seen 
to provide a conservative estimate of the expected multijet background. The robustness of the 
method is further validated by varying the range of Mx 2 in which the exponential term is fitted, 
and by changing the Acpmin requirement used to define the control regions. 





6.2 Determination of the W(H)+jets and leptonic top-quark background 


9 


19.5 fb" 1 (8 TeV) 



Figure 3: The ratio r(Mj 2 ), described in the text, as a function of Mx 2 for events satisfying the 
medium-Hx and the (Nj = 3-5,Nb = 0) requirements of the inclusive-Mi 2 search. The solid 
circle points correspond to simple data yields, while the points with open circles correspond 
to data after the subtraction of the non-multijet backgrounds, as estimated from simulation. 
Two different functions, whose exponential components are fitted to the data in the region 
50 < Mj 2 < 80 GeV, are shown. The green dashed line presents an exponential function, while 
the blue solid line is the parameterization used in the estimation method. 

In the low-Hx regions, the E™ ISS requirement of the triggers distorts r(Mx 2 ) for low values of 
Mx 2 - Therefore, the data selected by the standard triggers cannot be used to obtain r(Mx 2 )- 
Other triggers, based on Hj only, are used instead. These triggers accept only a small fraction 
of the events that satisfy the trigger criteria ("prescaled"), allowing access to the low-Hx region 
without a Ey ISS requirement. 

The dominant sources of uncertainty for this method include the statistical uncertainty of the 
fit, the stability of the fit under variations of the fit conditions, the statistical uncertainty of the 
control region with A</) min < 0.2 used for the extrapolation, and a 50% uncertainty assigned to 
the choice of the Mx 2 value used to define the constant term in the functional form of r(Mj 2 ). 
In signal regions with low Mx 2 , where the exponential term of Eq. (J3| dominates the constant 
term, this method provides a relatively accurate estimate of the background, with uncertainties 
as small as 10% that increase to around 50% for signal regions with less statistical precision. 
For signal regions with large Mx 2 , the constant term dominates and the uncertainty increases 
to 50-100%. Note that at large Mj 2 , the estimate of the multijet background provided by this 
method, while conservative, is nonetheless negligible compared to the contributions of the 
other backgrounds. 

6.2 Determination of the \N(lv) +jets and leptonic top-quark background 

The background from W(H/)+jets and top-quark production (mainly tt +jets events, but also 
single top-quark production) stems from events with a leptonically decaying W boson in which 
the charged lepton either lies outside the detector acceptance, or lies within the acceptance 
but fails to satisfy the lepton reconstruction, identification, or isolation criteria. Since these 
events arise from a lepton (e, }i, or r lepton) that is not found, we call them "lost-lepton" 
events. For both sources of lost leptons, the contribution from r leptons is slightly higher 
than from electrons or muons since the reconstruction efficiency for r leptons is smaller and 
the acceptance criteria are more stringent than for the other two types of leptons. According 
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Figure 4: Distribution of the Mj 2 variable for events with one electron (left), one muon (mid¬ 
dle), or one r lepton (right) in data and simulation. The events satisfy either the low -Hj se¬ 
lection (top) or either of the medium- and high-H T selections (bottom). They also satisfy the 
remaining inclusive-Mx 2 selection requirements, with the exception of the lepton veto. Finally, 
the condition Mj < 100 GeV is imposed on the charged lepton-E™ lss system. 


to simulation, around 40% of this background can be attributed to events containing a lost 
r lepton. The contribution of electron and muon events is of equal size. 

For each signal region, the lost-lepton background is estimated in a corresponding data control 
sample for which the full event selection is applied, with the exception of the lepton veto, i.e. 
exactly one charged lepton (e, }i, or r lepton) is required instead of zero. To reduce the potential 
contribution of signal events to the control samples, the transverse mass of the lepton-E™ ISS 
system is required to satisfy Mj < 100 GeV. The Mj 2 distributions of events satisfying the 
selection as outlined in Section |4} but after requiring one reconstructed and identified lepton, 
are shown in Fig.[4]for both data and simulation. 

After subtracting the number of events expected due to the misidentification of hadrons as lep¬ 
tons and due to leptons from hadron decays, the numbers of events in the one-lepton control 
samples are scaled by a lost-lepton factor = [1 — e{i )\/ [e(€)e(Mx)], where e(£) is the com¬ 
bined lepton efficiency and acceptance, and e(Mj) is the efficiency of the Mj selection. This 
factor Rpi is therefore the transfer factor from the control region to the signal region, obtained 
in simulation. 

For large values of Mt 2 / we expect very few events with a single reconstructed charged lepton. 
Therefore, the estimation of the lost-lepton background is performed in data for all topological 
regions in (Nj, Nb) and for the different Hj selections, but integrating over all Mj 2 bins. The 
factor lift is recalculated for each topological signal region and for the different selections in Hj. 
The estimated number of background events is divided among the different Mj 2 bins using the 
shape of the Mj 2 distribution as predicted by simulation. 
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The systematic uncertainty in the integrated lost-lepton background estimate includes the un¬ 
certainties in the lepton efficiencies, acceptance, and the subtraction of the lepton events asso¬ 
ciated with misidentification and hadron decays. These uncertainties are obtained by studying 
the differences between data and simulation using so-called tag-and-probe H56l and tight-to- 
loose 11571 methods. These uncertainties amount to about 10-20%. Including the statistical 
uncertainties from the data control regions, the total uncertainty of the lost-lepton background 
ranges from 10 to 65%. The uncertainty in the shape of the Mj 2 distribution is estimated by 
varying parameters in the simulation. The most important of these uncertainties are the recoil 
modelling Il52ll (20%), the matching scale, the renormalization and factorization scales (10-20%), 
and the jet energy scale 11441 (10%). The numbers in parentheses correspond to maximal varia¬ 
tions in the Mj 2 shape, but the overall normalization is not affected since it is predicted using 
the aforementioned method. The differences in shape between the distributions in data and 
simulation, shown in Fig. |4j lie within these uncertainties. 


The effect of signal contributions to the lost-lepton control samples can be significant and is 
taken into account before the interpretations presented in Sections [9] and 10 are performed. 
Specifically, the predicted yield in the signal regions is corrected by subtracting the additional 
signal contribution caused by the possible presence of the signal in the lost-lepton control sam¬ 
ple. 


6.3 Determination of the Z(nt/)+jets background 

The Z (vi')+jets background is estimated by selecting a control sample of 7 +jets events and 
then subtracting the photon momentum in the computation of all the relevant event quantities, 
such as Mj 2 , in order to replicate the decay of a Z boson into undetected neutrinos. After 
the subtraction of the photon momentum, the pj and the Mj 2 variables are recalculated and 
the event selections corresponding to the different signal regions are applied. The number of 
selected events, which is rescaled as described below, provides the background estimate for the 
Z (i/T)+jets process. 

As discussed in Ref. f58|, the Z+jets and 7 +jets processes differ because of the different elec¬ 
tro weak couplings and the non-zero Z-boson mass niz■ For vector boson pj fflz, however, 
the ratio of cross sections for prompt-photon to Z-boson production is determined by the ratio 
of the couplings of the respective boson to quarks, and thus approaches a constant value. In 
this range of the boson pj, the distributions of Hj and other kinematic observables are very 
similar for the Z+jets and 7 +jets processes. The 7 +jets process, with its relatively large event 
yield, is thus well suited to provide an estimate of the Z (vv)+jets background. 

Figure [5] shows a comparison between data and simulation for the Mj 2 distribution in 7 +jets 
control samples, for which N b =0 is required. The photon pj is added to the p™ iss vector and 
all event variables are recalculated. To reduce the potential contribution of signal events to 
these control samples, we require the reconstructed Ey iss to be less than 100GeV prior to in¬ 
cluding the reconstructed photon momentum. For the low-Hj signal regions, the 7 +jets events 
are selected with a single-photon trigger, which requires the photon pj to exceed 150 GeV. The 
single-photon trigger is used because the triggers discussed in Section [3] are unable to select 
events with low enough E™ ISS . For the medium- and high-Hj signal regions, the triggers dis¬ 
cussed in Section |3] are used. 

The selected photon control samples contain both genuine prompt-photon events and events 
with collinear pairs of photons that stem from neutral-meson decays within jets and are recon¬ 
structed as single photons. The prompt-photon fraction in the control samples is obtained by 
means of a maximum likelihood fit of templates from simulation to a photon shower shape 
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M T2 [GeV] M T2 [GeV] 

Figure 5: Distribution of the Mji variable for data and simulation after requiring the presence 
of one photon, Nb = 0 , and the remainder of the inclusive-Mx 2 selection criteria. Events sat¬ 
isfying the low-Hj selection (left), and the medium- and high-Hy selections (right) are shown. 
For these results, Mj 2 is calculated after adding the photon pj to the E™ ISS vector. 

variable in data. The shower shape variable that we use is which is a measure of the lateral 
extent in // of the photon energy cluster in the calorimeter |5l1 . The fit is performed sepa¬ 
rately in the electromagnetic calorimeter barrel and endcap detectors, for events with N\, = 0 
and with no requirement on Mj 2 - This sample of events is dominated by low-px photons, 
for which the shower shape variable provides high discrimination between prompt photons 
and photons from neutral-meson decays. Starting from the overall prompt-photon fraction ob¬ 
served in data, we use simulation to extrapolate the contributions of the two types of photon 
events in Mx 2 - For each signal region with Nb = 0, the final Z (i/F)+jets background estimate 
is obtained from the number of prompt-photon events, rescaled by the Mx 2 -dependent ratio of 
Z (i/i7)-i-jets to 7 +jets events from simulation. The Z(vv)/ 7 ratio increases as a function of the 
photon pj and reaches a constant value above 350 GeV, as shown in Fig. [ 6 ] 

The accuracy of the Z-boson pj distribution in simulation is validated using a control sample 
of dileptonic Z-boson events, i.e. Z —> e + e or u 1 f/, selected with dilepton triggers. Here, 
analogously to the photon control sample, the dilepton momentum is subtracted in the com¬ 
putation of all relevant event quantities, such as Mx 2 , in order to model the Z —> vv decay. 
From the data-to-simulation comparison of the Z(E + E ~)/7 ratio as a function of the search 
variables, a systematic uncertainty of 20% is assigned to the Z(vv)/y ratio. For the signal re¬ 
gion bins corresponding to Mx 2 > 350 GeV, this uncertainty increases to 30% because of large 
statistical uncertainty in the ratio for events with large Mx 2 - Compared to these uncertainties, 
the normalization uncertainty associated with the shower shape fit is negligible. 

The Z(yv) / 7 ratio may not be well modelled in simulation for Nb > 1 as the coupling of Z 
bosons and photons differs for b quarks. If the b-quark content in simulation is mismodelled 
(for example the modelling of gluon splitting g —> bb), the Z(vv) / 7 ratio might be biased in 
b-quark enriched events. Another biasing effect might be the treatment of the b-quark mass 
in simulation, which affects the coupling of b quarks to Z bosons and photons. Therefore, the 
previous procedure is only applied in signal regions with Nb = 0. For the Nb = 1 case, the 
results obtained from the Nb = 0 control samples are scaled by Z^(lb)/Z^(0b), the ratio of 
the numbers of events containing dileptonic decays of the Z boson and Nb = 1 or Nb = 0, 
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Figure 6: Ratio Z(vv)/y of events satisfying the event selection of the (Nj > 2, Nb = 0) signal 
region as a function of the boson pj. The events are summed inclusively in all Hj sub-regions 
with Hj > 450 GeV. The ratio is obtained in simulated events after the photon momentum is 
included in the E™ lss calculation. 

respectively This ratio is obtained using data from the dilepton control sample for different 
values of Nj. As the ratio is found to depend neither on Mj 2 nor Hj, its value is measured 
without any requirement on these two variables, in order to increase the statistical precision of 
the control samples. 

Uncertainties in the Z^(lb)/Z^(0b) ratio are evaluated by varying the kinematic selections to 
test the stability of the ratio. The resulting uncertainties are mostly determined by the statistical 
limitations of the control samples. The size of the uncertainty is 10-30% for the regions with 
Nj < 5, while it is 50-75% for regions with Nj > 6. 

For the signal regions with Nb > 2, the Z (vv)+jets background is estimated from simulation 
and is assigned an uncertainty of 100%. We verified that using an uncertainty twice as large, or 
twice as small, has a negligible impact on the final results. The explanation for this is that for 
Nb > 2, the Z (t/i/)+jets background is very small compared to the tt +jets background. 


7 Results 

This section reports the number of events observed in the signal regions. The yields are com¬ 
pared with the estimated number of background events as predicted by the methods described 
in Section [U 

7.1 Results for the inclusive-Mr 2 analysis 

For the inclusive-Mx 2 search, the final event yields in all signal regions are shown in Figs. [7j-|9] 
The comparison between observed and predicted yields is shown separately for different topo¬ 
logical regions and for the different Hj selections. The total uncertainty of the background 
estimates is the quadratic sum of the statistical and systematic uncertainties from the three 
categories of background. The results are tabulated in Table [2] The shape uncertainty in the 
estimation of the lost-lepton background is not included either for the figures or table. 
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Figure 7: Distributions of the Mjj variable for the estimated background processes and for 
data. Plots are shown for events satisfying the lo w-Hj (left), the medium-Hj (middle), and the 
high-Hj (right) selections, and for different topological signal regions (Nj, Nb) of the inclusive- 
Mt 2 event selection. From top to bottom, these are (Nj = 2, Nb = 0), (Nj = 2, Nb > 1), and 
(3 < Nj < 5, Nb = 0). The uncertainties in each plot are drawn as the shaded band and do not 
include the uncertainty in the shape of the lost-lepton background. 
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Figure 8: Distributions of the Mj 2 variable for the estimated background processes and for 
data. Plots are shown for events satisfying the low -Hj (left), the medium-Hj (middle), and the 
high-Hj (right) selections, and for different topological signal regions (Nj, Nb) of the inclusive- 
Mj 2 event selection. From top to bottom, these are (3 < Nj < 5, Nb = 1), (3 < Nj < 5, Nb = 2), 
(Nj > 6, Nb = 0). The uncertainties in each plot are drawn as the shaded band and do not 
include the uncertainty in the shape of the lost-lepton background. 
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Figure 9: Distributions of the Mjj variable for the estimated background processes and for 
data. Plots are shown for events satisfying the lo w-Hj (left), the medium-Hj (middle), and the 
high-Hj (right) selections, and for different topological signal regions (Nj, Nb) of the inclusive- 
Mt 2 event selection. From top to bottom, these are (Nj > 6, Nb = 1), (Nj > 6, Nb = 2), 
(Nj > 3, Nb > 3). The uncertainties in each plot are drawn as the shaded band and do not 
include the uncertainty in the shape of the lost-lepton background. 
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Table 2: Event yields, for estimated background and data, in the signal regions of the inclusive- 
Mj 2 search. The uncertainties are the quadratic sum of statistical and systematic uncertainties. 


Signal 

Low-H t region 


Medium-Hj region 


High-Hy region 


region 

M T2 [GeV] 

Prediction 

Data 

My 2 [GeV] Prediction Data 

M T2 [GeV] 

Prediction 

Data 


200-240 

553±70 

588 

125-150 

167±21 

171 

120-150 

21.9T4.9 

18 


240-290 

395±53 

451 

150-180 

128T17 

104 

150-200 

19.4T4.3 

18 


290-350 

288±40 

318 

180-220 

85.8T11.3 

91 

200-260 

14.5T3.4 

10 

7 \T o 

350-420 

236±52 

232 

220-270 

70.0T10.3 

78 

260-350 

6.3T1.8 

9 

i\/j — Z, 

420-490 

165±36 

162 

270-325 

38.1T5.8 

48 

350-550 

4.3T1.6 

8 

N b =0 

490-570 

68.9T15.5 

61 

325-425 

43.4T10.1 

45 

>550 

3.0T1.4 

6 


570-650 

17.3T4.3 

19 

425-580 

21.3T4.7 

29 





>650 

4.1T1.6 

1 

580-780 

20.8T5.6 

10 








>780 

3.5T1.4 

2 





200-250 

56.4T12.8 

56 

100-135 

27.4T9.6 

30 

100-180 

11.4T8.1 

2 


250-310 

34.2T8.1 

44 

135-170 

21.1T7.5 

19 

>180 

4.4T2.6 

2 

Nj =2, 

310-380 

25.9T7.4 

29 

170-260 

13.4T5.4 

15 




Nb >1 

380-450 

19.9T5.8 

13 

260-450 

7.3T3.5 

7 





450-550 

12.6T3.8 

15 

>450 

3.4T1.7 

9 





>550 

2.6T0.8 

3 
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1041 
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243±23 
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34.9T4.7 

39 


240-290 
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827 

185-215 

180±19 

203 
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31.1T4.7 

32 


290-350 

492±65 

522 

215-250 

134±16 

152 

220-270 

25.5T4.3 

25 

AT _ O C 

350-420 

280T57 

333 

250-300 

112±14 

119 

270-350 

19.3T3.5 

19 

J D, 

420-490 

138±29 

145 

300-370 

89.0T12.2 

91 

350-450 

9.1T2.5 

6 

N b =0 

490-570 

60.0T13.6 

66 

370-480 

67.0T14.2 

75 

450-650 

5.0T1.6 

5 


570-650 

13.8T3.9 

21 

480-640 

35.0T8.0 

40 

>650 

4.4T1.6 

5 


>650 

3.6T1.5 

2 

640-800 

10.0T2.7 

16 








>800 

3.4T1.5 

4 





200-250 

305±34 

300 

150-175 

93.4T10.7 

87 

150-180 

13.5T3.1 

28 


250-310 

167±21 

172 

175-210 

69.5T8.7 

71 

180-230 

8.7T2.2 

7 

Nj = 3-5, 

310-380 

103T16 

98 

210-270 

52.8T6.8 

63 

230-350 

6.2T1.6 

9 

N b = l 

380-460 

43.6T8.7 

47 
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47 

>350 

3.5T1.0 
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19 
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>600 
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4 
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91.1T22.0 

97 
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42.4T7.5 

53 
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6.8T2.3 
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AT _ O C 
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39 
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29 
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— O D, 
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16 
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11 
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11 








>370 

2.9T1.1 

5 
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50.8T8.9 

56 

160-200 

38.5T6.2 

44 

160-200 

12.1T2.9 

12 
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14.7T3.1 

16 

200-250 

19.3T3.6 

34 

200-300 

10.1T3.2 

7 

IN/j > b. 

>380 

7.3T2.3 

8 

250-325 

14.1T2.8 

23 

>300 

4.5T1.7 

2 

N b =0 




325-425 

5.8T1.9 

9 








>425 

2.3T0.8 

4 





200-250 

32.0T6.7 

31 
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38.7T5.9 

38 
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7.3T3.2 

6 

Nj >6, 
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14.7T3.1 

23 
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21.1T3.5 

21 
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5.1T2.4 

5 
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>325 

4.8T1.5 

11 
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10.5T1.9 

13 
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3.0T0.8 
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15 
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41.0T7.0 

54 
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10.6T6.0 

10 

Nj >6, 
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4.6T1.6 

13 

170-220 

19.4T3.8 

28 

>200 

4.7T2.9 

2 

N b =2 

>300 

2.8T1.0 

6 

220-300 

10.4T2.1 

8 








>300 

4.3T0.8 

6 





200-280 

16.1T6.2 

16 

125-175 

31.9T11.4 

17 

>125 

4.5T2.1 

3 
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4.6T1.7 

7 
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7 Results 


The level of compatibility between the data and the SM predictions is assessed by computing 
the pull value for all signal regions, where the pull value is defined for each signal region bin 
as: 


Pull = 


hiobs hlbkg 



(4) 


where N 0 b s is the observed number of events, <7 0 b s is its statistical uncertainty, and Nbk g is the 
background estimate with a total uncertainty of <7bk g . After the average pull over all the signal 
regions is calculated, pseudo-experiments are used to evaluate the probability to observe an 
average at least as large as the average observed in data. The probability is found to be 11%. 
Thus, the data are found to be in agreement with the SM predictions within the uncertainties. 


In order to present the results in a compact manner, the yields of all Mj 2 bins that belong to 
the same topological region and that satisfy the same Hj selection are summed. The resulting 


sums are presented in Fig. 10 


19.5 fb' 1 (8 TeV) 



Figure 10: Event yields, for both estimated backgrounds and data, for the three Hj selections 
and all the topological signal regions of the inclusive-Mi 2 search. The uncertainties are drawn 
as the shaded band and do not include the uncertainty in the shape of the lost-lepton back¬ 
ground. 


7.2 Results for the M T 2 Higgs analysis 

For the Mi 2 -Higgs analysis, the observed numbers of events in data and the predicted back¬ 
ground yields are summarized in Table[3]for the two different selections in Hj. The background 
predictions and the data yields are shown for the different Mbb bins in Fig. 11 along with the 
distribution of events for a possible SUSY scenario. This scenario is based on gluino pair pro¬ 
duction in which one of the gluinos produces one h boson in its decay chain. More details 


about this signal scenario are provided in Section 10.1 




































































19 


Table 3: Event yields for the W(H/)+jets and tt +jets processes (i.e. the lost-lepton background), 
the Z (i/i7)+jets background, and data. Yields are shown for both the low- and the high-Hx 
selections of the Mj 2 -Higgs search. The lost-lepton background is estimated from data control 
samples, while the Z (i'v)+jets is evaluated using simulation. 


> ™ 
CD 

0 18 


Channel 

Tost lepton 

Z (i/i/)+jets 

Total background 

Data 

Fow-Hx 

37.1 ±9.0 

6.9 ±6.9 

44.0 ±11.3 

55 

High-Hx 

64.8 ± 16.4 

4.4 ± 4.4 

69.2 ± 17.0 

81 
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Figure 11: Distributions of the Mbb variable for the W(£i/)+jets and tt +jets processes (i.e. the 
lost-lepton background), the Z (vT)+jets background, data, and a possible SUSY signal. The 
distributions are shown for both the low- (left) and the high-Hx (right) selections of the Mx 2 ~ 
Higgs search. The lost-lepton background is estimated from data control samples, while the Z 
(i/i/)+jets is evaluated using simulation. The uncertainties in each plot are drawn as the shaded 
band and do not include the uncertainty in the shape of the lost-lepton background. The signal 
model consists of gluino pair production events with one of the two gluinos containing an 
h boson in its decay chain. For this model it is assumed ms = 750 GeV and rn-o = 350 GeV. 


8 Systematic uncertainties 


A summary of the range-of-effect for each source of uncertainty relevant for the background 
prediction or signal efficiency is presented in Table [4j While the systematic uncertainties in 
the background predictions have already been discussed in Section [6] the dominant sources of 
systematic uncertainties in the selection efficiencies of signal events are described here. 

The corrections for the differences observed between the signal simulation and data due to 
the jet energy scale and b-tagging efficiencies yield uncertainties in the signal yield of around 
5%, but these uncertainties can become as large as 40% in kinematically extreme regions. The 
uncertainty associated with the corresponding correction to account for the px spectrum of 
the recoil system reaches a maximum of 20% for px > 250 GeV. The systematic uncertainty 
associated with the parton distribution functions is evaluated following the prescription of the 
PDF4FHC group 159H631 , and is found to have an effect of about 5%, increasing to a maximum 
of 15% for small splittings between the parent particle mass and the ESP mass. Additionally, 
uncertainties associated with the luminosity determination [1641 and the trigger efficiency are 
included. 







































20 


9 Statistical interpretation of the results 


Table 4: Summary of the different systematic uncertainties of the SM background predictions 
and of the signal efficiency A given source of uncertainty can contribute differently depending 
on the search region, and the typical ranges of effect are shown. Sources of uncertainty that 
change the shape of the Mt 2 distributions in the inclusive-Mi 2 analysis or the shape of the Mbb 
distributions in the Mx 2 -Higgs search are marked with a cross in the last column. 


Process 

Source/Region 

Effect 

Shape 

Multijet 

Mx 2 < 200 GeV 

10-50% 

— 

Mx 2 ^ 200 GeV 

50-100% 

— 


Lost-lepton method (sys ® stat) 

10-65% 

— 

W(/:i')+jets and Top 

b-tagging scale factor 

Jet energy scale 

Matching scale 

Renormalization and factorization scales 

— 

X 

X 

X 

X 


System recoil modelling 

— 

X 

Z (i/i7)+jets 

Systematics on Z(vv)/'y ratio (Nb = 0-1) 
Systematics on lb/Ob ratio from (Nb = 1) 

Statistics from y+jets data (Nb = 0-1) 

20-30% 

10-75% 

5-100% 

— 


Simulation (Nb > 2) 

100 % 

— 

Signal 

Integrated luminosity 

Trigger efficiency 

Parton distribution functions 

2 .6% 

1 % 

5-15% 

— 

b-tagging scale factor 

Jet energy scale 

System recoil modelling 

5-40% 

5-40% 

10 -20% 

X 

X 

X 


9 Statistical interpretation of the results 


This section describes the statistical procedure used to interpret the observed event yields 
in order to set upper limits on the cross sections of potential signal processes. A test of the 
background-only and signal+background hypotheses is performed using a modified frequen- 
tist approach, often referred to as CL S H65i 

Signal regions are combined through a joint likelihood function. This function is constructed as 
the product of Poisson probabilities for each bin of Nj, N't,, Hj, and Mx 2 - The Poisson probabil¬ 
ities are functions of the number of observed events in each bin, n,, and the predictions in each 
bin. A,, where i ranges from 1 to the number of bins, Nbm S - The likelihood function is given by 


Anns A 

£=n ■ 

i=i 


Kf! 


(5) 


The prediction in each bin is a sum over the signal and background contributions: 

N>kg 

A i = iiSi+ Yjbij, (6) 

7=1 


where bjj is the background prediction in bin i for background source /, and s, is the signal 
prediction in bin z, scaled by the signal-strength modifier ji to test other values of the signal 
production cross section, a = /z< 7 sig , with u sig the nominal cross section for the signal model 
under consideration. 


The uncertainties are handled by introducing nuisance parameters 9. The signal and back¬ 
ground expectations, therefore, become dependent on N sys nuisance parameters 9 m , where 
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m = 1... N sys , i.e. s = s(9 m ) and b = b(6 m ). All sources of uncertainties are taken to be either 
100%-correlated (positively or negatively) or uncorrelated (independent), whichever is found 
to be appropriate. Incorporating the nuisance parameters, the likelihood function becomes: 

C(data\p, 9) = Poisson(data|f/s(0) + b(6))p(9 ), (7) 

where p(9) is the probability density function associated with the given systematic uncertainty 
In this equation, C(data\p, 9) is the likelihood function for data for a given value of p and 6. 


In order to test the compatibility of the data with the background-only and signal+background 
hypotheses, a test statistic q } , l66l is constructed starting from the profile-likelihood ratio: 


q t i = -2 In 


C (data \p, 9 }l ) 
C (data|/l,0) , 


with 0 < p < p, 


( 8 ) 


where "data" can be the actual data or the output of a pseudo-experiment. Both the denomina¬ 
tor and numerator are maximized. In the numerator, the signal parameter strength p remains 
fixed and the likelihood is maximized for only the nuisance parameters, whose values after the 
maximization are denoted (9 fi . In the denominator, the likelihood is maximized with respect to 
both p and 6, and p and 9 are the values for which C is maximal. The lower constraint 0 < fi is 
imposed as the signal strength cannot be negative, while the upper constraint guarantees a one¬ 
sided confidence interval (this means that upward fluctuations of data are not considered as 
evidence against the signal hypothesis). The value of the test statistic for the actual observation 
is denoted as £j° bs . This test statistic was chosen by the LHC Higgs Combination Group |67|| . 

To set limits, probabilities to observe an outcome at least as signal-like as the one observed are 
calculated for the null (background-only) hypothesis Hq and for the test (signal+background) 
hypothesis Hi, for a given value of the signal-strength modifier p, as: 

CL s+h (p)=P(q }l >qf s \H 1 ) l 

CL h (p)=P(q }l >qf s \H 0 ). 

The CL S quantity is then defined as the ratio of these probabilities: 


CL s (p) 


CL s+ b(/<) 
CLbOO • 


( 10 ) 


In the modified frequentist approach, the value of CL s (p) is required to be less than or equal 
to dc in order to declare a (1 — a) CL exclusion. We set 95% CL limits on the signal cross section 
by finding the value of p for which CL S (^) = 0.05. 


In practice, the probability distributions of the background-only and the signal+background 
hypotheses are determined from distributions of the test statistic constructed from pseudo¬ 
experiments. Once the ensembles of pseudo-experiments for the two hypotheses are generated, 
the observed CL S limit is calculated from these distributions and the actual observation of the 
test statistic £j° bs . The expected CL S limit is calculated by replacing (j° bs by the expected median 
from the distribution of the background-only hypothesis. Further details on the procedure 
employed to compute the limits on the signal production cross section are given in Ref. H67l . 


10 Exclusion limits 

The 95% CL upper limits on signal production cross sections are computed following the CL S 
formulation described in Section [9} using the results presented in Section [7] and the systematic 
uncertainties summarized in Section [8l 
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10 Exclusion limits 


10.1 Exclusion limits on simplified models 

In this section, we interpret the results of our search in terms of simplified models Ii26l , which 
allow the exclusion potential of the data to be examined in the context of a large variety of 
models. 


The following list describes the simplified models that are probed and the corresponding sub¬ 
sets of signal regions from the inclusive-MT 2 search that are used to set the limits: 

• direct pair production of squarks with q —> qy^. The topological regions that are 
used to probe this model are those defined by the selections (Nj = 2, N b = 0), 

(Nj = 2,Nb > 1), (3 < Nj < 5,Nb = 0), (3 < Nj < 5,N b = 1), and (Nj > 

6,N b = 0). Exclusions limits are shown in Fig. [l2] (upper left) for two scenarios: 
one assumes that the first two generations of squarks (ul, Ur, d[, dR, cy, Cr, Sl, Sr) 
are degenerate and light; the other requires that only one light-flavour squark be 
kinematically accessible. 

• direct pair production of bottom squarks with b —> by^. The signal regions that are 
used are those defined by (Nj = 2, N b > 1), (3 < Nj < 5, N b = 1), and (3 < Nj < 

5, N b = 2). The corresponding exclusion limits are shown in Fig. [T2] (upper right). 

• direct pair production of top squarks with t —> ty^. The topological regions used to 
probe this model are those defined by (3 < Nj < 5, N b = 1), (3 < Nj < 5, N b = 2), 

(Nj > 6,N b = 1), (Nj > 6,N b = 2), and (Nj > 3, N b > 3). The corresponding 
exclusion limits are shown in Fig. [l2| (bottom). 

• gluino pair production with g —> qqyj- The topological regions used to probe this 

model are those defined by (3 < Nj < 5, N b = 0), (3 < Nj < 5, N b = 1), (Nj > 

6, N b = 0), and (Nj > 6, N b = 1). The corresponding exclusion limits are shown in 
Fig. [13] (upper left). 

• gluino pair production, with g —» bby^. The topological regions used to probe this 

model are those defined by (3 < Nj < 5, N b = 1), (3 < Nj < 5, N b = 2), (Nj > 

6, Nb = 1), (Nj > 6, Nb = 2), and (Nj > 3, N b > 3). The corresponding exclusion 
limits are shown in Fig. [13] (upper right). 

• gluino pair production, with g —> tfy^. The topological regions used to probe this 
model are those defined by (Nj > 6, N b = 1), (Nj > 6, N b = 2), and (Nj > 3, N b > 3). 

The corresponding exclusion limits are shown in Fig. [13] (bottom). 

All exclusion limits are obtained at NFO + next-to-the-leading-logarithm (NLF) order in oc s . 


For the direct pair production of top squarks, the analysis is not sensitive to model points with 
m t — m-i) = m t , because the y 1 , 1 is produced at rest in the top-squark frame. 


For all the considered models, the observed limits are compatible within one standard devia¬ 
tion with the expected limits, with the exception of the limits on the direct pair production of 
top squarks with t —» ty”, which are shown in Fig. 12 (bottom). A comparison between the 


background estimates obtained directly from simulation and those calculated from the data 
control samples suggests that the weaker-than-expected limits are not caused by an excess in 
the signal region, but rather by a downward fluctuation in the lost-lepton control sample, lead¬ 
ing to a possible underestimate of the lost-lepton background. Considering the large number 
of data control samples (there are 81 lost-lepton control regions), the probability to observe a 
fluctuation as large as the one observed is ~65%. 


The results of the Mx 2 -Higgs search are used to probe the following model: gluino pair pro- 
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Figure 12: Exclusion limits at 95% CL for (upper left) direct squark production, (upper right) 
direct bottom-squark production, and (bottom) direct top-squark production. For the direct 
squark production, the upper set of curves corresponds to the scenario where the first two 
generations of squarks are degenerate and light, while the lower set corresponds to only one 
accessible light-flavour squark. For convenience, diagonal lines have been drawn correspond¬ 
ing to 7ii~ o = MZqg t anc ^ m x'( = m qbt ~ ( ? ” w + ?n b) where applicable. 
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10 Exclusion limits 
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Figure 13: Exclusion limits at 95% CL for gluino mediated (upper left) squark production, (up¬ 
per right) bottom-squark production, and (bottom) top-squark production. For convenience, 
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10.2 Exclusion limit in the cMSSM/mSUGRA model 
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duction with one gluino decaying via g —> X .2 IVC?/ and the other gluino decaying via 

g —* qq'^i , xf ~t W^y ( |. In this scenario, the neutralino X 2 ar >d chargino X\ are assumed to 
be degenerate, with mass m~ 0 = m~± = m^o + 200 GeV. The corresponding exclusion limits are 
shown in Fig. [14] 
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Figure 14: Exclusion limits at 95% CL for gluino pair production with one gluino decaying 
via g —> qqy^, X 2 while the other gluino decays via g —> qq'yj, XT ~> W^y ( |. For 

convenience, diagonal lines have been drawn corresponding to m ~0 = nig and m ~0 = mg — 
200 GeV. 


10.2 Exclusion limit in the cMSSM/mSUGRA model 
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Figure 15: Exclusion limits at 95% CL as a function of mo and m t / 2 for the cMSSM/mSUGRA 
model with tan/3 = 30, Ao = -2max(mo,mi/2), and }i > 0. Here, mq is the average mass of 
the first-generation squarks. 
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11 Summary 



Figure 16: Exclusion limits at 95% CL as a function of mg and mg for the cMSSM/mSUGRA 
model with tan/3 = 30, Aq = — 2max(mo,mi/2), and }i > 0. Here, mg is the average mass of 
the first-generation squarks. 


We also provide an interpretation of our results in terms of the cMSSM/mSUGRA model. The 
model has five free parameters: mo, my 2 , Aq, tan /3, and sign ft. In order to obtain an h boson 
mass of about 125 GeV, the value Aq = — 2max(mo, m 1 / 2 ) is chosen, as proposed in Ref. H68l . 
Furthermore, we choose }i > 0 and tan /3 = 30. Exclusion limits as a function of uiq and my 2 
are shown in Fig. 15 These limits are presented in Fig. 16 as a function of mg and mg, where mg 


is the average mass of the first-generation squarks. 

In Table |5j we summarize the exclusion limits from Figs. 12-15 


11 Summary 

A search for supersymmetry (SUSY) in hadronic final states characterized by large values of 
unbalanced transverse momentum has been carried out using a sample of i/s = 8 TeV pp col¬ 
lisions. The data were collected by the CMS experiment at the CERN LHC and correspond to 
an integrated luminosity of 19.5 fb -1 . An event selection based on the kinematic mass vari¬ 
able Mj 2 has been employed to reduce the background from standard model processes and to 
enhance the sensitivity of the search to a wide range of SUSY signatures. 

Two related searches have been implemented. The first is an inclusive search based on several 
signal regions defined by the number of jets and b-tagged jets, the hadronic energy in the event, 
and the value of the Mt 2 variable. The second is a search for events that contain a Higgs boson 
in the decay chain of a heavy SUSY particle. Assuming that this boson decays to a bottom 
quark-antiquark pair in accordance with the branching fraction of the standard model Higgs 
boson, this category of events has been investigated to seek an excess at 125 GeV in the invariant 
mass distribution of the selected b-tagged jet pairs. 

No significant excess over the expected number of background events has been observed, and 
95% confidence level exclusion limits on several SUSY simplified models and on the cMSSM/ 
mSUGRA model have been derived. Mass limits have been conservatively derived using the 
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Table 5: Summary of observed mass limits (at 95% CL) for different SUSY simplified models 
and for the cMSSM/mSUGRA model. The limits quoted are the observed limits using the 
signal cross section minus one standard deviation (<7 lheorv ) of its uncertainty For the simplified 
models, the limit on the mass of the parent particle is quoted for m^o = 0, while for the LSP 
the best limit on its mass is quoted. The best limit on the mass splitting between the parent 
particle mass and the LSP mass is also given. Finally, the absolute limits on the squark and 
gluino masses are quoted for the cMSSM/mSUGRA model. 


Simplified 

model 

Limit on parent particle 
mass at 77Z ~o = 0 

A1 

Best limit on 
LSP mass 

Limit on 
mass splitting 

Direct squark production 
Single light squark 

777g > 520 GeV 

ni~o > 120 GeV 

Am(q,*?) < 200 GeV 

8 degenerate light squarks 

777 q > 875 GeV 

o > 325 GeV 

A777 (q,^) < 50 GeV 

Bottom squark 

mg > 640 GeV 

m^o > 275 GeV 

a 1 

A 777 (b, V?) < 10 GeV 

Top squark 

m t > 777 1 + m^o 

m~y > 450 GeV 

777^0 > 60 GeV 

Am(t,£$) < 230 GeV 

m j < 77z t + m^o 

777-j: > 175 GeV 

rriTp o > 60 GeV 

AJ 

Am(t,x°) < 90 GeV 

Direct gluino production 

g -a qqvi 

mg > 1225 GeV 

777 > 510 GeV 

A1 

Am (g/Vi) < 25 GeV 

g -L bb $ 

77Zg > 1300 GeV 

m ~o > 740 GeV 

Am(g,x5) < 50 GeV 

g -L ttY? 

m g > 1225 GeV 

m? o > 450 GeV 

A1 

Am(g,x$) < 225 GeV 

gi ->■ qqr?, -l h 
g2 -L qq'tff , Xi -4- W**? 

777 g > 825 GeV 

m~a > 410 GeV 

A1 

Am(g,x^) < 225GeV 

cMSSM/mSUGRA model 

Mass limit for mg = mg 

Gluino mass limit 

Squark mass limit 


??7g,q > 1550 GeV 

777 g > 1150 GeV 

777 g > 1450 GeV 


theoretical signal cross sections reduced by one times their uncertainty (—ln t h e0 ry)- I n the 
context of simplified models based on pair-produced gluinos, each decaying into a quark- 
antiquark pair and a lightest SUSY particle (LSP) via an off-shell squark, gluino masses have 
been probed up to 1225-1300 GeV depending on the squark flavour. For the direct pair produc¬ 
tion of the first- and second-generation squarks, each assumed to decay to a quark of the same 
flavour and a light LSP, masses below 875 GeV have been probed under the assumption of eight 
degenerate light squarks. If only a single squark is assumed to be light, this limit decreases to 
520 GeV. For the direct production of third-generation squark pairs, each assumed to decay 
to a quark of the same flavour and a light LSP, masses up to 640 GeV for bottom squarks and 
450 GeV for top squarks have been probed. In the cMSSM/mSUGRA scenario corresponding 
to tan (6 = 30, Ao = —2 max(mo, ni \/ 2 ), and /z > 0, absolute mass limits have been found to 
be: 777 a > 1450 GeV, ?//g > 1150 GeV, and 77Zg = mg > 1550 GeV when equal squark and gluino 
masses are assumed. 
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